On the practice of error analysis for machine translation evaluation

نویسندگان

  • Sara Stymne
  • Lars Ahrenberg
چکیده

Error analysis is a means to assess machine translation output in qualitative terms, which can be used as a basis for the generation of error profiles for different systems. As for other subjective approaches to evaluation it runs the risk of low inter-annotator agreement, but very often in papers applying error analysis to MT, this aspect is not even discussed. In this paper, we report results from a comparative evaluation of two systems where agreement initially was low, and discuss the different ways we used to improve it. We compared the effects of using more or less fine-grained taxonomies, and the possibility to restrict analysis to short sentences only. We report results on inter-annotator agreement before and after measures were taken, on error categories that are most likely to be confused, and on the possibility to establish error profiles also in the absence of a high inter-annotator agreement.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

Evaluation of the Validity and Reliability of a Communicative Scale for Translation Quality Assessment

The present study assessed the construct validity and reliability of a researcher-constructed psycho-motor mechanism scale based on the communicative theory of translation proposed by PACTE (2003). In doing so, the necessary criteria for designing the scale were obtained by a thorough review of related literature on previously constructed scales in error analysis or holistic ones. Moreover, in ...

متن کامل

Evaluation of the Validity and Reliability of a Communicative Scale for Translation Quality Assessment

The present study assessed the construct validity and reliability of a researcher-constructed psycho-motor mechanism scale based on the communicative theory of translation proposed by PACTE (2003). In doing so, the necessary criteria for designing the scale were obtained by a thorough review of related literature on previously constructed scales in error analysis or holistic ones. Moreover, in ...

متن کامل

Evaluation of the Validity and Reliability of a Communicative Scale for Translation Quality Assessment

The present study assessed the construct validity and reliability of a researcher-constructed psycho-motor mechanism scale based on the communicative theory of translation proposed by PACTE (2003). In doing so, the necessary criteria for designing the scale were obtained by a thorough review of related literature on previously constructed scales in error analysis or holistic ones. Moreover, in ...

متن کامل

Knowledge Required for Avoiding Lexical Errors at Machine Translation

This research aims at finding out the causes that led to wrong lexical selections in machine translation (MT) rather than categorizing lexical errors, which has been a main practice in error analysis. By manually examining and analyzing lexical errors outputted by a MT system, it suggests what knowledge would help the system reduce lexical errors. Keywords—Error analysis, causes of errors, mach...

متن کامل

A new model for persian multi-part words edition based on statistical machine translation

Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012